S4: structure-based sequence alignments of SCOP superfamilies
نویسندگان
چکیده
S4 is an automatically generated database of multiple structure-based sequence alignments of protein superfamilies in the SCOP database. All structural domains that do not share more than 40% sequence identity as defined by the ASTRAL compendium of protein structures are included. The alignments are constructed using pairwise structural alignments to generate residue equivalences that are then integrated into multiple alignments using sequence alignment tools. We describe the database and give examples showing how the automatically generated S4 alignments compare favourably to hand-crafted alignments. Available at: http://compbio.mds.qmw.ac.uk/S4.html.
منابع مشابه
PASS2 version 4: An update to the database of structure-based sequence alignments of structural domain superfamilies
Accurate structure-based sequence alignments of distantly related proteins are crucial in gaining insight about protein domains that belong to a superfamily. The PASS2 database provides alignments of proteins related at the superfamily level and are characterized by low sequence identity. We thus report an automated, updated version of the superfamily alignment database known as PASS2.4, consis...
متن کاملPASS2: A Database of Structure-Based Sequence Alignments of Protein Structural Domain Superfamilies
Sequence alignments guided by structural features are particularly suited for distant relationships and they permit a better sampling of the protein sequence space. Reliable sequence alignments could be useful in evolutionary biology and in defining structurefunction relationships for protein superfamilies. PASS2 database presents structure-based alignments of protein domains related at the sup...
متن کاملLenVarDB: database of length-variant protein domains
Protein domains are functionally and structurally independent modules, which add to the functional variety of proteins. This array of functional diversity has been enabled by evolutionary changes, such as amino acid substitutions or insertions or deletions, occurring in these protein domains. Length variations (indels) can introduce changes at structural, functional and interaction levels. LenV...
متن کاملAutoSCOP: automated prediction of SCOP classifications using unique pattern-class mappings
MOTIVATION The sequence patterns contained in the available motif and hidden Markov model (HMM) databases are a valuable source of information for protein sequence annotation. For structure prediction and fold recognition purposes, we computed mappings from such pattern databases to the protein domain hierarchy given by the ASTRAL compendium and applied them to the prediction of SCOP classifica...
متن کاملDiscrimination between distant homologs and structural analogs: lessons from manually constructed, reliable data sets.
A natural way to study protein sequence, structure, and function is to put them in the context of evolution. Homologs inherit similarities from their common ancestor, while analogs converge to similar structures due to a limited number of energetically favorable ways to pack secondary structural elements. Using novel strategies, we previously assembled two reliable databases of homologs and ana...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Nucleic Acids Research
دوره 33 شماره
صفحات -
تاریخ انتشار 2005